Skip to main content

Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling

  • Conference paper
Latent Variable Analysis and Signal Separation (LVA/ICA 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6365))

Abstract

Blind Source Separation (BSS) arises in a variety of fields in speech processing such as speech enhancement, speakers diarization and identification. Generally, methods for BSS consider several observations of the same recording. Single microphone analysis is the worst underdetermined case, but, it is also the more realistic one. In this article, the autoregressive structure (short term prediction) and the periodic signature (long term prediction) of voiced speech signal are modeled and a linear state space model with unknown parameters is derived. The Expectation Maximization (EM) algorithm is used to estimate these unknown parameters and therefore help source separation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://www.eurecom.fr/~bensaid/ICA10

  2. Cichocki, A., Thawonmas, R.: On-line algorithm for blind signal extraction of arbitrarily distributed, but temporally correlated sources using second order statistics Neural Process. Neural Process. Lett. 12(1), 91–98 (2000)

    Article  MATH  Google Scholar 

  3. Barros, A.K., Cichocki, A.: Extraction of specific signals with temporal structure. Neural Comput. 13(9), 1995–2003 (2001)

    Article  MATH  Google Scholar 

  4. Tordini, F., Piazza, F.: A semi-blind approach to the separation of real world speech mixtures. In: Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN 2002, vol. 2, pp. 1293–1298 (2002)

    Google Scholar 

  5. Smith, D., Lukasiak, J., Burnett, I.: Blind speech separation using a joint model of speech production. IEEE Signal Processing Letters 12(11), 784–787 (2005)

    Article  Google Scholar 

  6. Chu, W.C.: Speech coding algorithms-foundation and evolution of standardized coders. John Wiley and Sons, NewYork (2003)

    MATH  Google Scholar 

  7. Feder, M., Weinstein, E.: Parameter estimation of superimposed signals using the EM algorithm. IEEE Trans. Acoust., Speech, Signal Processing 36, 477–489 (1988)

    Article  MATH  Google Scholar 

  8. Gannot, S., Burshtein, D., Weinstein, E.: Iterative-batch and sequential algorithms for single microphone speech enhancement. In: ICASSP 1998, pp. 1215–1218. IEEE, Los Alamitos (1998)

    Google Scholar 

  9. Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Society B 39, 1–38 (1977)

    MathSciNet  Google Scholar 

  10. Gao, W., Tsai, S., Lehnert, J.: Diversity combining for ds/ss systems with time-varying, correlated fading branches. IEEE Transactions on Communications 51(2), 284–295 (2003)

    Article  Google Scholar 

  11. Couvreur, C., Bresler, Y.: Decomposition of a mixture of Gaussian AR processes, Acoustics, Speech, and Signal Processing. In: 1995 International Conference on ICASSP 1995, vol. 3, pp. 1605–1608 (1995)

    Google Scholar 

  12. Christensen, M., Jakobsson, A., Juang, B.H.: Multi-pitch estimation, Morgan & Claypool (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bensaid, S., Schutz, A., Slock, D.T.M. (2010). Single Microphone Blind Audio Source Separation Using EM-Kalman Filter and Short+Long Term AR Modeling. In: Vigneron, V., Zarzoso, V., Moreau, E., Gribonval, R., Vincent, E. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2010. Lecture Notes in Computer Science, vol 6365. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15995-4_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15995-4_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15994-7

  • Online ISBN: 978-3-642-15995-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics